Towards a Corpus Annotated for Metonymies: the Case of Location Names
نویسندگان
چکیده
At the moment, language resources do not contain the necessary information for large-scale metonymy processing. As a contribution, we here present a corpus annotated for metonymies. We describe a framework for annotating metonymies in domain-independent text that considers the regularity, productivity and underspecification of metonymic usage. We then present a fully worked out annotation scheme for location names and a gold standard corpus containing 2000 annotated location names. The annotation scheme is rigorously evaluated as to its reliability and compared to previous metonymy classification proposals. In particular, we show that it is not sufficient to rely on intuitions for reliable metonymy identification and that an annotation effort with trained annotators and explicit guidelines is necessary.
منابع مشابه
Corpus-based Metonymy Analysis 1 Running head: Corpus-based Metonymy Analysis Corpus-based Metonymy Analysis
In this paper we make the case for corpus-based metonymy analysis and show that many interesting linguistic and statistical questions can only be answered by working with real texts. To facilitate such studies, we present a method for annotating metonymies in domain and genre-independent text. We advocate an annotation scheme that builds on regularities in metonymic usage, that takes underspeci...
متن کاملLogical metonymies and qualia structures: an annotated database of logical metonymies for German
Logical metonymies like The author began the book involve the interpretation of events that are not realized in the sentence (covert events:→ writing the book). The Generative Lexicon (Pustejovsky, 1995) provides a qualia-based account of covert event interpretation, claiming that the covert event is retrieved from the qualia structure of the object. Such a theory poses the question of to what ...
متن کاملMetonymy Resolution as a Classification Task
We reformulate metonymy resolution as a classification task. This is motivated by the regularity of metonymic readings and makes general classification and word sense disambiguation methods available for metonymy resolution. We then present a case study for location names, presenting both a corpus of location names annotated for metonymy as well as experiments with a supervised classification a...
متن کاملروشی جدید جهت استخراج موجودیتهای اسمی در عربی کلاسیک
In Natural Language Processing (NLP) studies, developing resources and tools makes a contribution to extension and effectiveness of researches in each language. In recent years, Arabic Named Entity Recognition (ANER) has been considered by NLP researchers due to a significant impact on improving other NLP tasks such as Machine translation, Information retrieval, question answering, query result...
متن کاملCovert Events and Qualia Structures for German Verbs
Sentences like The author began the book (logical metonymies) involve the interpretation of covert events which are not explicitly realized on the surface (→ The author began writing the book). Qualia-based accounts of logical metonymies (Pustejovsky, 1991, 1995) account for such covert events using complex lexical entities (qualia structures) for the objects. We present a corpus study for the ...
متن کامل